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In this paper we consider spatial networks that realize a balance between an infrastructure cost 
(the cost of wire needed to connect the network in space) and communication efficiency, measured 
by average shortest pathlength. A global optimization procedure yields network topologies in which 
this balance is optimized. These are compared with network topologies generated by a competitive 
process in which each node strives to optimize its own cost-communication balance. Three phases are 
observed in globally optimal configurations for different cost-communication trade-offs: (i) regular 
small worlds, (ii) star-like networks and (iii) trees with a centre of interconnected hubs. In the 
latter regime, i.e. for very expensive wire, power laws in the link length distributions P(w) oc w~ a 
are found, which can be explained by a hierarchical organization of the networks. In contrast, in 
the local optimization process the presence of sharp transitions between different network regimes 
depends on the dimension of the underlying space. Whereas for d = oo sharp transitions between 
fully connected networks, regular small worlds and highly cliquish periphery-core networks are found, 
for d — 1 sharp transitions are absent and the power law behaviour in the link length distribution 
persists over a much wider range of link cost parameters. The measured power law exponents are in 
agreement with the hypothesis that the locally optimized networks consist of multiple overlapping 
sub-optimal hierarchical trees. 

PACS numbers: 64.60.aq,89.75.Hc,89.75.Fb 
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I. INTRODUCTION 

In recent years much interest has been focused on char- 
acterizing complex systems as networks. Seemingly uni- 
versal characteristics have been revealed in analyses of 
many real-world systems such as power grids, the inter- 
net, metabolic and gene-regulatory networks and some 
social networks. Scale-free degree distributions |l| and 
a small- world character are the prime examples |2j. In 
the first case, many networks are found to have a degree 
distribution with a power law tail p(k) oc fc -7 with an 
exponent 7 in the range 2 < 7 < 3. The second case, 
small worlds, describe networks that combine a high de- 
gree of cliquishness (reminiscent of, for example, an un- 
derlying spatial organisation) and small average shortest 
pathlength that scale logarithmically with the system size 
as in random graphs. Recent developments in the field 
are summarized in [3[ . 

It is important to find mechanisms that explain the ob- 
served network topologies of real-world systems. Many 
such mechanisms have been suggested to date, rang- 
ing from assembly mechnisms such as preferential at- 
tachment and varieties thereof [3[ to optimization [J, Q ■ 
Whereas the first approach assumes that a network is 
gradually assembled by a piecewise addition of new nodes 
according to certain stochastic rules, the latter approach 
assumes that the existing system represents the endpoint 
of an evolution that maximized some fitness measure. 
This idea was first advanced in [4J , which discussed mech- 
anisms for the emergence of small-world networks. The 
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authors of the study considered a set of nodes distributed 
in space and investigated networks that minimize a com- 
bination of pathlength and cost of wire needed to connect 
the nodes, while keeping the number of links fixed. The 
paper also argued that power laws in the link length dis- 
tributions emerge from such an optimization principle, 
but no theoretical explanation was given. This idea was 
extended in [5|, which studied networks that optimize 
a trade-off between the cost of links and optimal com- 
munication measured by average shortest pathlength. If 
the first consideration dominates, optimal networks are 
star-like. If shortest pathlengths are more desirable than 
link economy, fully connected networks were found to 
be optimal. At a transition point between both regimes 
scale-free networks with an exponent a — 3 have been re- 
orted. Subsequent studies made use of the approach of 
, investigating optimal transport with congestion [y 
or subject to targetted attacks [8(, synchronization 

tor networks that give rise to stable linear dynamics 

In this paper we revisit the results of [J] and |5( and 
consider networks that realize a trade-off between cost 
of links and average shortest pathlength. However, un- 
like in |5[ , we do not measure the cost of links as simply 
proportional to the number of links. Instead, like [J] we 
investigate a set of nodes that are located in space and 
find the network that optimizes a trade-off between aver- 
age shortest pathlength and the cost of the amount 'wire' 
required to connect the nodes in space. In this way we 
introduce spatial constraints into the network optimiza- 
tion problem. Spatial constraints play an important role 
in the formation of various real-world networks. Like- 
wise, pressures to minimize the network distance shape 
communication networks, for which the physical layout 



of the internet is an important example. 



II. THE GLOBAL OPTIMUM 



Since it appears more natural that links can be formed 
and removed as demanded by resource availability, we 
also do not fix the number of links in the optimization 
as it was done in [4J. Thus, in our model multiple short 
links can be transformed into one long link, which turns 
out to be of importance below. In this way the model 
is able to capture both the 'connectivity' transition be- 
tween sparse and dense networks discussed in [5j , but also 
includes spatial constraints and a discussion of the spa- 
tial organization of the network arrangements becomes 
possible. In the first part of the study we undertake a 
detailed analysis of different regimes in the phase dia- 
gram of networks which realize varying trade-offs of cost 
of wire and communication. 

All optimization of a global property (like pathlength 
and wire in this case) in a distributed system assumes 
(i) the functioning of the elementary components as a 
whole such that individual components benefit from an 
improved global performance and (ii) the possibility to 
coordinate alterations in system structure such that lo- 
cal and global improvements can be aligned, similar to 
conflicts between local and global optima that are the 
subject of competitive Game Theory. Relaxing these 
requirements, in the second part of the paper we also 
consider a localized optimization approach, where each 
node strives to optimize its own (local) trade-off between 
cost of wire and the average pathlength to other nodes. 
This concept of competitive optimization of a network by 
distributed agents appears again relevant. For instance, 
infrastructure and communication networks are not al- 
ways formed according to a central planning scheme, but 
links are often established or removed when it appears 
profitable to a local actor. 

Analyzing the differences between the globally and lo- 
cally optimal solutions, we demonstrate that local opti- 
mization leads to a set of more 'realistic' network topolo- 
gies and a broadening of the parameter region where 
power laws in the link length distribution can be ob- 
served. Notably, the exponents of the power laws de- 
scribing the link length distributions obtained from the 
competitive optimization process are much smaller than 
observed for globally optimal networks, thus being close 
to some exponents reported for various real-world sys- 
tems. 

Link length distributions of a power law form have 
been reported in several studies before and seem to occur 
in contexts as varied as for the internet [14j , integrated 
circuits |15| . the human cortex |16l | and certain regions 
of the human brain [17| ■ For instance, for 2-dimensional 
systems Ref. fl41| re ports an exponent of —1 for the in- 
ternet and Ref. [l5| an exponent of —.75 for integrated 
circuits. Our study aims to contribute towards a better 
understanding of mechanisms that generate such power 
laws. 



Consider a set of N nodes with labels i = 0, ..., N — 1 
and spatial positions xq,-.-,Xn-i and let d(i,j) be the 
spatial distance between the nodes i and j. The nodes 
are assumed to be uniformly distributed in a space with 
periodic boundary conditions, albeit the procedure we 
discuss below can easily be adapted to non-uniformly 
distributed nodes in space, if required in specific appli- 
cations. For example in one dimension we have x% = Si, 
i = 0, ..., N — 1 and d(i,j) = mhi(\xi —Xj\, NS— \xt — Xj\) 
is a distance measure on that space, in which S defines 
the spacing of the grid. Without loss of generality we 
assume 6=1 for the following. Note, that this definition 
of a distance measure differs from that assumed in [j, 
where the spatial distance between nodes on a Id grid 
was defined by the spatial distance in 2d when the nodes 
are embedded on a circle. Advantages of our formulation 
are (i) that it appears more natural and does not require 
an embedding in a higher dimensional space and (ii) that 
its generalization into higher dimensions is obvious. 

In contrast to the spatial metric, the network distance 
between nodes i and j, l(i,j), is the minimum number of 
'hops' to get from i to j on the network. Thus, whereas 
the former distance measure is related to the cost of wire 
to establish connections between nodes, the latter is a 
measure for the efficiency of communication. 

Generally, the amount of wire required to connect 
nodes in some d-dimensional space will depend on the 
dimension and topology of that space. Maximum dis- 
tances in a d-dimensional space with periodic boundary 
conditions scale as d max = \/d/2N 1 / d , i.e. for fixed sys- 
tem size they decrease steeply with d. Since we mostly 
present numerical results (and the determination of av- 
erage shortest pathlengths is numerically expensive) and 
are also interested in the interplay of many spatial scales 
in network formation, we restrict the bulk of the paper 
to d = 1, where distances up to d max = N/2 can be ob- 
served and only briefly comment on d — 2 at the end of 
the paper. 

Formally, we consider the problem to find network con- 
figurations A = (aij) i .^.q that minimize an energy- like 
quantity E(X) 



E{\) =XW+(1-X)l, 



(1) 



where w = 



N(N-\) J2i <J a i] d ( i J) is tne (normalized) 
amount of wire needed to realize the network Ta given 
by its adjacency matrix A and 7 = N (£-i) ^ l(i>j) is 
the average shortest pathlength on the network r^ . The 
parameter A is a measure for the relative weight of the 
cost of wire and desirability of efficient communication 
in Eq. Q. 

To get a first handle on the problem, some limiting 
cases and particular network configurations are of in- 
terest, cf. also Fig. [T] for some illustrations. We first 
discuss some simple limiting cases in subsection III Al 
and then, in subsection III Bl we introduce and analyse 
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FIG. 1: Illustration of optimal networks in the order in which they appear when the trade-off parameter is increased from A = 
to A = 1. 



(a) EPT with k=2, v=1/2 




level 



level 1 

level 2 
level 3 



(b) EPT with k=2, v=l/3 
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FIG. 2: Examples of equi-partitioning trees (EPT's) of TV" = 
15 nodes embedded in a Id space without periodic boundary 
conditions: (a) an EPT with k = 2 and v - 1/2, (b) an EPT 
with k = 2 and v = 2/3. 



equi-partitioning trees (EPT), a particular class of trees 
that have power law distributed link length distributions. 
This class of networks is important, because it establishes 
a link between power law link length distributions and 
the hierarchical structure of networks. Having considered 
these particular network configurations this then allows 



us to study transitions in the optimal topologies when 
the trade-off parameter A is varied in subsection III CI We 
conclude the section by the presentation and analysis of 
numerical results for optimal networks in subsection lllDI 



A. Limiting cases: 



Complete graphs, stars and 
chains 



Some simple graph structures are of interest because 
they demarcate optimal solutions for the limiting situa- 
tions A = and A = 1. The first of these are complete 
graphs for which 7=1. Neglecting terms smaller than 
0(1) elementary calculations also yield w — N /4 + 1/4, 



£fuii(A) = A(A74-3/4) + l. 



(2) 



In passing we note that the link length distribution for 
the complete graph is uniform, i.e. all lengths occur with 
the same frequency. 

The second limiting configuration of interest are star- 
like networks, in which one hub is connected to all other 
nodes. A back of the envelope calculation gives 7=2 — 
2/N and W = 1/2(1 + 1/2V) for such networks. Since we 
are interested in large networks we again neglect terms 
smaller than 0(1) and obtain 



£star(A)=-3/2A + 2. 



(3) 



Note that the expression in Eq. ([3]) is independent of the 
system size N. As for the complete graph the link length 
distribution for the star configuration is uniform. 

Relevant as a limiting case are also regular grids, for 
instance a simple ring graph, for which only direct spa- 
tial neighbours are connected. Neglecting order 0(1/N) 
terms one then has 7 = 7V/4+1/4 while trivially w = 2/N. 
Hence 



E ri 



X {2/N - N/4 - 1/4) + JV/4 + 1/4. (4) 



The link length distribution of the ring is characterized 
by only one length scale which corresponds to the spacing 
of the grid. 



B. Equi-partitioning trees 

A further class of networks is of interest. These net- 
works are hierarchical trees, which partition the under- 
lying space into equal parts such that root nodes at a 
lower level of the hierarchy are always in the middle of 
the space occupied by the sub-trees at the higher levels 
of the hierarchy. Assuming that N = 2 k+1 — 1 a recursive 
construction is the following: 

1. Start with the interval I = [0,2 fe+1 - 1], i.e. the 
whole space. The node in the middle, i.e. at po- 
sition 2 fc — 1 is the root node. Proceed with step 
2 with both of the intervals I\ = [0, 2 k — 1) and 
I2 = (2 fe — 1,N] and connect the root node 2 k — 1 
to the root nodes for both intervals. 

2. Given the nodes belonging to an interval I = 
[711,712], if 77,1 = 772 consider 777 = 771 as the root 
node and stop the procedure. Otherwise, select the 
node in the middle of /, 777, which is assigned the 
role of root node. Proceed with step 2. for the in- 
tervals I\ = [771,777) and I2 — (777,772] and connect 
the root node m to the root nodes of I\ and Ii- 

Figure [5^ gives an illustration of such a tree and its em- 
bedding in the underlying space. If N 7^ 2 fe+1 — 1 the 
procedure can still be applied, resulting in imperfect trees 
that are not completely balanced. Above, we constructed 
trees by subdividing intervals into two equal parts, re- 
sulting in a graph where apart from the minimum and 
maximum level of the hierarchy all nodes have degree 3. 
Similarly, subdividing intervals into k parts will result in 
equi-partitioning trees with larger degree. These, how- 
ever, have very similar properties to the trees constructed 
with k = 2 so that we only consider the case of k = 2 in 
detail below. 

A simple calculation shows that for N = 2 k+1 — 1 the 
total amount of wire used in the construction of an EPT 
with k — 2 is k2 k such that for large N in leading or- 
der w = l/Nlog 2 N. Albeit a bit more elaborate (see 
appendix for details) the calculation of the average path- 
length is also straightforward and yields 



1 = l/2\og 2 N - 1 
for large N . Hence one has 



(5) 



E 



EPT 



(A) = A 



log 2 ^ 

N 



+ (l-A)(-l + l/21og 2 A). (6) 



It is interesting to note that the link length distribu- 
tion of EPT's follows an inverse power law P(w) oc w~ a 
with a — 2. This is verified by the observation that for 
k = 0, ..., log 2 N there are 2 k+1 links that establish con- 
nections between nodes at level k and nodes at level k+1. 



All of these links have equal length w = 2~( fe+1 'yV, such 
that P(w) oc w~ 2 . The relationship P(w) oc w~ 2 still 
holds for EPT's with k > 2, or also for random parti- 
tioning trees for which the new root nodes are chosen at 
random from the interval [711 , 772] in step 2 of the con- 
struction algorithm. This is seen from the self-similarity 
of the trees, i.e. when considering length scales of or- 
der w/k one essentially considers k copies of the orginal 
tree, i.e. the link length distribution P(w) of the tree has 
the scaling property P(w/k) oc k 2 P(w). The solutions 
to this scaling relationship are power laws P(w) oc w~ a 
with a = 2. 

Note, that above we have always embedded nodes in 
space in such a way that typical distances were scaled 
by a factor 1/k when proceeding to the next level of the 
hierarchy and multiplying the number of nodes by a fac- 
tor of k. Following this procedure the embedding of the 
nodes in space is dense, i.e. the tree includes every node 
in space. Trees that don't include all nodes in space can 
be constructed if one introduces a scaling relationship for 
length scales v smaller than 1/k when proceeding from 
one level of the hierarchy to the next, cf. Fig. [2Jd for an 
illustration of such a tree for k = 2 and v = 1/3. 

A simple construction principle for such EPTs with 
general k and v could be the following recursive proce- 
dure: 

1. Select a root node n and a length scale L and 
divide the interval [no — Lq/2, hq + Lq/2] into k 
parts assigning the nodes no — Lo/2 + Lo/(2n), tiq — 
Lq/2 + 3L /(2«), ..., 77 + L /2 - L /(2n) the role 
of root nodes for the next level of the hierarchy. 
Proceed with step 2 for each of the new root nodes 
and set the length scale to L\ = vL$. 

2. Stop the procedure if the length scale Lk is smaller 
than the spacing of the grid. Otherwise, given a 
root node 77fc and a length scale Lk divide the inter- 
val [77^ — Lfe/2, nk + Lk/2] into k parts assigning the 
nodes n k - L k /2 + L k /{2n), n k - L k /2 + 3L fe /(2«), 
..., 77fe + Lk/2 — Lk/(2n) the role of root nodes for 
the next level of the hierarchy. Proceed with step 
2 for each of the new root nodes and set the length 
scale to Lk+i = vLk- 

Even though for v < l/« the tree contains only part 
of the nodes, several such trees, possibly with different 
numbers of hierarchical levels, can be embedded in space 
and connected by a few links, such that the combined 
tree essentially still has the same link length distribu- 
tion as an EPT with v < 1/k. By constriction, the link 
length distribution of EPT's with general K and v obeys 
the scaling relationship P(yw) oc k/vP(w). Assuming a 
power law ansatz P(w) oc w~ a one obtains 



a = - logy k + 1, 

reproducing the special case of a 
we discussed above. 



(7) 
2 for k — \jv that 



Since the construction only affects the location of 
nodes in space, EPT's with v < l/« have the same aver- 
age pathlength as an EPT with k — 1/v. However, since 
for v < 1/k one has a(n, v) < a(n, 1/k) EPT's with 
v < 1/k require more wire than EPT's with v — 1/k, i.e. 
they are sub-optimal and one does not expect them to oc- 
cur when optimizing E(X) at a global level. Nevertheless, 
they give an interesting reference case for the case of the 
competitive optimization considered in section Mil 



C. Transitions 

The expressions in Eq.'s ©, ©, (0J and © already 
illuminate that a similar transition as the one described 
in Q between star-like and fully connected networks will 
occur in spatially constrained networks. Clearly, for A = 

one has £fuii < -Estar < -Eept < -Bring, i«e. the complete 
graph is optimal. 

When A becomes substantially larger than 0(1/ N) 
complete graphs lose competitiveness since the contri- 
bution of the cost of wire scales linearly with N. Then, 
out of our four example networks over a wide range of 
A-values star networks are the preferred configurations. 
Eventually, when the cost of wire becomes the domi- 
nant consideration for A = 1, one has -E s tar = 1/2, 
Eept ~ log 2 /N and E ring ~ 1/N, i.e. ring graphs are 
optimal. 

EPT's, which are larger than stars, but require less 
wire for their construction than stars, will become pre- 
ferred in the regime in between the star network regime 
and the regime very close to A = 1 in which the ring 
graphs are optimal. To see this, consider Ajy = 1 — 1/N. 
In leading order one then has -Eept(Ajv) oc l/N\og 2 N, 
-Estar(AAr) = 1/2 and -E r i ng = 1/4, i.e. EPT's outperform 
both star networks and ring graphs. 

These rough back of the envelope calculations thus 
sketch out a phase diagram with several possible transi- 
tions. For A <C 1/N optimal networks are fully connected 
and a transition towards star networks is expected for in- 
creasing A. Increasing the relative cost of wire further, for 

1 — A -c 1/JV a second transition from star-like networks 
towards ring graphs is expected. Our above arguments 
also show that at the transition, in a small parameter 
regime close to A = 1 hierarchical partitioning trees with 
power law link length distributions become optimal. 

Of course, the three discussed network topologies only 
stand for 'typical' network configurations that illustrate 
that transitions between homogeneous and heteroge- 
neous networks can be expected when increasing A, and 
a second transition from heterogeneous networks towards 
linear chains can be reckoned with when further increas- 
ing A close to 1. To explore the full 'phase diagram' be- 
tween both extremes, we proceed with a numerical sim- 
ulation to construct networks for < A < 1. 



D. Numerical results and analysis 

In the following construction of optimal networks we 
employ a numerical optimization scheme that operates 
via simulated annealing, similar to previous work, cf. [4| . 
The scheme is typically seeded with a fc-regular lattice 
with small k. At each iteration of the scheme a rewired 
network configuration that differs from the previous con- 
figuration either by (i) the addition of a link at a ran- 
domly selected link vacancy, (ii) the deletion of a ran- 
domly selected link, or (iii) the exchange of the endpoints 
of two randomly selected pairs of connected nodes is sug- 
gested. Then the energies of the original and rewired con- 
figurations, E and E' , are calculated. The rewired config- 
uration is accepted with probability q — 1 if E' < E and 
with probability q ex exp(—rt(E' — E)) otherwise. In the 
above t stands for the iteration number and the constant 
r determines the rate of cooling down in the annealing 
scheme. In the procedure connectedness of the networks 
is enforced and the formation of loops is prevented. 

It is worthwhile to note that in the above we only fix 
the system size and the constant A. The complete op- 
timal network structure, i.e. the number of links, the 
amount of wire required, and structural properties of 
the optimal network topologies 'emerge' from the guided 
rewiring scheme. 

To elucidate the dependence of optimal network con- 
figurations on the parameter A, we constructed ensem- 
bles of at least 100 optimal networks of N = 100 nodes 
for systematically varied values of A. To characterize 
the optimal topologies and their embedding in space, we 
measured the energies, cf. Eq. ([T]), the average amount 
of wire per node W/N, the average degree k = 2L/N, 
the degree variation a\ = ^(fc — W) 2 , the clustering 
coefficient C, the average length of a link w = W/L, 
maximum and minimum degrees fc max and fc m j n and the 
average shortest pathlength I. Figure 02 panels (a)-(h) 
gives an overview over these network statistics when the 
parameter A is changed and Fig. Q] schematically illus- 
trates typical optimal network configurations for different 
values of A. 

While the dependence of the energy E(X) on A in panel 

(a) of Fig. [3] is smooth, several network characteris- 
tics reveal discontinuities. Most prominently, in panel 

(b) of Fig. [3] one notes jumps in the amount of wire 
per node w = W/N at A = 0.21 and A = 0.80. The 
first point marks a connectivity transition from densely 
wired networks to sparsely wired star-like networks. Be- 
low A = 0.21 one finds close to regular highly cliquish 
long range worlds. Networks above A = 0.21 are charac- 
terized by the presence of a single central hub node, that 
connects to all other nodes. The long range links between 
the hub and the periphery nodes are complemented by 
a large number of short range links that connect near- 
est neighbours in space. Optimal networks are networks 
composed of a Ai-regular periphery network and an ad- 
ditional central hub node that links to all other nodes. 
The value of k above gives the number of short range 
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FIG. 3: Phase diagram showing the dependence of E on A for the global optimum (filled circles) and for the local optimum from 
a competitive individual optimization process (open circles). The arrow points to critical points which mark the transitions 
between different types of networks, see text. The panels show (a) the energy E(X), (b) the amount of wire per node, (c) the 
number of links per node (degree), (d) variation of the degree (normalized by the star configuration), (e) clustering coefficient, 
(f) average length of a link W/L, (g) maximum and minimum degrees for the global optimum configuration and (h) average 
pathlengths. Data are for N = 100 and represent averages over at least 100 networks for each value of A. 



connections of periphery nodes. As A is increased and 
wire becomes more and more expensive, more and more 
of these short range links are dropped, which leads to 
a series of transitions towards lower values of k, till fi- 
nally at A = 0.66 a transition from the k — 2 ('wagon 
wheel') network to a star configuration with k — oc- 
curs, cf. the drop of the clustering coefficient in panel (e) 
and the steep increase of the average link length in panel 
(f). The star persists until at A = 0.8 the long range con- 
nections required in the star become too expensive and 
the central hub is replaced by a collective of hub nodes 
of roughly similar degree. These hub nodes, which are 
homogenously distributed in space, serve as local centres 
and link to nodes closest to them in space. Initially for A 
close to the transition point A = 0.8 the hubs are tightly 
interlinked by long range links, which leads to a non- 
vanishing clustering coefficient: note the little 'hump' at 
A = 0.8 in panel (e). Excluding the short cycles between 
the hub nodes the remainder of the network is tree-like. 
Increasing A further, long range connections between hub 
nodes are thinned out and the number of hub nodes in- 
creases, while also the difference in the degrees of hub 
nodes and periphery nodes becomes smaller. Cycles in 
the networks become increasingly fewer and longer, even- 
tually leading to the appearance of trees. The organi- 
zation of these trees is hierarchical and very similar to 
EPT's. When A is extremely close to 1, such that the 
economy of wire dominates but short pathlengths still 
give a small contribution to Eq. ([1}, a ring graph is op- 
timal. Finally, at A = 1 a linear chain, in which nearest 
neighbours in space are connected, becomes optimal. 



In the light of the scale-free networks found in the 
sparsely connected regime in [5[ a closer investigation of 
the transitions appears of interest. It turns out that the 
transition from star-like to exponential and linear net- 
works when increasing A does not lead via a regime in 
which scale-free networks appear. The reason for this 
is found in the embedding in an underlying space: the 
central hub of the star networks is replaced by a collec- 
tive of hubs of equal degrees when increasing A above 
A = 0.8. When further increasing A the number of these 
hub nodes increases, but they are always found to be of 
roughly similar degree, thus causing a distinct peak in 
the degree distribution. 

The sparse regime is also of interest since Ref. [J| or- 
in the context of synchronization problems in space- Ref. 
Ills] reported power laws in the link size distribution when 
wire is expensive. Since the star configurations are asso- 
ciated with uniform link length distributions such power 
laws can only be found for A > 0.8. In this regime, 
however, the networks are essentially tree-like, such that 
comparisons to real-world networks as those of |14l - tl7j 
appear unrealistic. Nevertheless, for 1 — A <C 1 the data 
give some support for power law tails of the link length 
distributions, cf. Fig. |4]which plots the link length distri- 
bution for optimal networks evolved for A = 0.98. For all 
the cases we investigated we found exponents slightly be- 
low a — — 2, thus being much larger than reported in j^j, 
but in good agreement with the link length distributions 
expected for EPT's and partitioning trees, cf. subsection 
III Bl The difference between our findings and [4J may be 
attributable to the variability in the number of links in 



5, 




The case of d = oo 



FIG. 4: Link length distribution of optimal networks of N = 
100 nodes constructed for A = 0.98. The line indicates a 
power law P(w) oc w~ a with exponent a = 2. 



the problem investigated in our study. 



III. LOCAL OPTIMA 



By defining normalized node contributions to wire 
Wi = 1/(JV— 1) J^ • a,ijd(i,j) and pathlength Ij = 1/(N — 
1) ^ l{hj) a node contribution to the fitness or energy 
.E(A) (cf. Eq. (JTJ) can be introduced 



ej(A) = Xu>i + (1 - A)£» 



(8) 



Trivially, E(X) = 2/iV^ ei(X). However, would a set of 
nodes where each node individually strives to optimize 
its own wire-pathlength balance also reach the optimum 
global configuration that we discussed in the previous 
section? While this appears possible in the symmetrical 
regular grid phase, we argue below that it is impossible in 
the 'symmetry-broken' phases where hub and periphery 
nodes coexist. 

To construct networks where nodes optimize their indi- 
vidual wire-communication balances, we consider the fol- 
lowing process. After seeding the algorithm with a ring 
lattice, in each iteration a node, e.g. node i, is chosen 
at random. Then, either a link from this node to a ran- 
domly selected neighbour is removed or a link between 
the node and another randomly selected node is intro- 
duced. This link addition/removal process generates a 
rewired network configuration, which is accepted if it re- 
duces i's energy balance ej(A) and rejected otherwise. If 
the altered configuration is rejected we proceed with the 
previous configuration. The process is repeated for a suf- 
ficiently large number of iterations, such that the average 
of the number of links (over intermediate timescales) is 
stationary. In the process above, connectedness of the 
networks is ensured and the formation of loops is pre- 
vented. 



Let us first consider the case of an infinite-dimensional 
underlying space, for which (up to a scaling factor) we 
set d(i,j) — lVi,j, reproducing the problem considered 
in [5[ . Thus for d — oo Eq. © reduces to 



ei (A) = Xh/{N -!) + (!- A)7, 



(9) 



where fc; is the degree of node i and Ij the average shortest 
pathlength from node i to all other nodes as above. To 
understand the instability of the star configuration in a 
situation when individual nodes optimize their individual 
cost of link-communication balances, let us consider the 
'wagon- wheel' configuration discussed above in sectionllll 
cf. Fig. IDJc). First, consider the central hub node, which 
has degree /c n ub = N — 1 and average network distance 
/hub = 1. Thus 



Chub 



A. 



(10) 



In contrast, a periphery node has k pcr — 3 and, neglecting 
0(1/N) terms, average pathlength 1=2, which gives 



= A3/iV + 2(l-A). 



(11) 



While benefitting from a slightly improved communica- 
tion the hub node has to pay for almost all the wire! 
Thus, in any large system with N 3> 1, a hub node 
would strive to drop links to improve its own cost of links- 
communication balance. This argument makes it clear 
that the star configurations that were found to be opti- 
mal in a wide range of A parameters in [5| can generally 
not survive in a local optimization process as introduced 
above. 

Before proceeding, let us first briefly recap the depen- 
dence of typical network structures on the parameter A as 
explained in [»]. For large A, i.e. expensive links, a sparse 
regime is found. Initially, for A close to 1 networks are 
trees with exponential degree distributions. Decreasing 
A the degree distributions broaden and for some interme- 
diate values of A scale-free trees are observed. At some 
critical value of A an abrupt transition towards star-like 
topologies takes place. Finally, decreasing A further a sec- 
ond transition from the star-like networks towards fully 
connected networks is found. 

Figure [6] gives a summary of network statistics that 
illustrate changes in the structure of the competitively 
evolved networks when the parameter A is changed, while 
Fig. [S]presents a graphical illustration of typical optimal 
network topologies in the order in which they appear 
when A is increased from to 1. In Fig. [6l as a guide 
for the discussion of the evolved network structures, we 
consider the number of links per node L/N (panel a), 
variance of the degree sequence (panel b), average short- 
est pathlength (panel c) and the average clustering coeffi- 
cient (panel d). The degree variance has been normalized 
by the degree variance of a star network. 
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the relative degree variance, (c) the average shortest path- 
length, and (d) the relative clustering coefficient on the pa- 
rameter A for d — oo and N = 100. Note the transition points 
at A = 0.66 and A = 0.934. 



As in the globally optimized networks of [5j, sharp 
transitions between different network regimes are evi- 
dent. The first point of interest is a transition towards 



fully connected networks at around Ai = 0.66. A second 
transition, marked by an abrupt drop in the number of 
links accompanied by a steep increase in degree variance, 
distance and clustering coefficient, is found at approxi- 
mately A2 = 0.934. A third transition is characterised 
by another sharp drop in the number of links, sharp in- 
crease in the degree variance and clustering, but decrease 
in the average shortest pathlength at around A3 = 0.966. 
Thus, excluding the fully connected topology, three types 
of network configurations need to be discussed. 

In the first non-trivial regime, Ai < A < A2, the net- 
works have distinctly smaller than expected degree vari- 
ances. Other network statistics reveal no significant de- 
viation to random expectations, the networks thus being 
very close to random regular graphs. At the first glance 
these networks appear very similar to the entangled nets 
discussed in Q. However, in contrast to the latter they 
have small, but non-vanishing clustering coefficients. 

Above the second transition, A2 < A < A3 the per- 
mutation symmetry of the previous regime is broken and 
the degree distributions become bimodal. The optimal 
networks become periphery-core networks composed of a 
core of hub nodes and a periphery of low degree nodes. 
Such networks have, e.g., been discussed in the context 
of networks in the immune system, cf. [lj|, but they 
also naturally occur when optimizing networks for diam- 
eter while requiring robustness against targetted node 
removals [8( . The periphery-core organization essentially 



results from the instability of the star configuration in the 
competitive optimization procedure. When selected for 
optimization, hub nodes strive to eliminate parts of their 
connections. However, since being outnumbered by the 
many periphery nodes, hub nodes are not able to drop 
connections with the same frequency as other nodes at- 
tempt to connect to them. Furthermore, while striving to 
disconnect from periphery nodes, it is beneficial for hub 
nodes to maintain connections with other hub nodes since 
this reduces their respective average pathlengths. These 
assortative connections between the hub nodes and the 
desirability for periphery nodes to form connections to 
hub nodes, lead to a strongly cliquish network arrange- 
ment (cf. panel d). This finding appears of interest since 
it illustrates a novel mechanism for the appearance of 
clustering, i.e. via the competitive optimization of cost 
of links and communication balances of individual nodes. 
To our knowledge this has not been discussed in the lit- 
erature before. 

Further increasing the cost of wire, the number of links 
decreases steadily. Since periphery nodes have only few 
connections to the core, connections between core nodes 
are thinned out, which reduces their respective degrees 
and thus decreases the degree variance in the network 
(cf. panel Fig. [5}d). As most triangles in the net- 
work result from the tightly connected core, thinning 
out the core also reduces the clustering coefficient and 
leads to an increase in average shortest pathlcngth. At 
A = 0.966 there is a further transition towards another 
periphery-core organization, distinguished by a sharply 
reduced core size. As links becomes more and more ex- 
pensive, this core gradually reduces in size, a process that 
further increases the degree heterogeneity, contracts the 
networks and eventually reduces clustering as the num- 
ber of hub nodes decreases. Further link cost increases 
finally drive the networks towards a star configuration. 



B. The case of d = 1 

Consider again the 'wagon-wheel' topology introduced 
in section [TT] and let us compare the hub node and a 
periphery node farthest away from the hub. Neglect- 
ing 0(1/ N) contributions the energy balance for the hub 
node reads 



e hub = AiV/2 + (1 - A), 



whereas 



--per 



2 - 3/2A 



(12) 



(13) 



for the periphery nodes farthest away from the hub. In 
d = 1 the energy difference between hub and periphery 
nodes is thus larger than for d = 00 and it increases in 
proportion to the number of neighbours of the central 
hub. 

Comparing the cases of d = 00 discussed in the pre- 
vious subsection and the case of d — 1 of interest here, 



a relevant consideration is also the 'expense' a periphery 
node would incur to link to a hub node. Whereas for 
d = 00 this is just Aw = 1/(N — 1) (i.e. one link), in 
d = 1 it varies between Aw = 1/(N — 1) for a node that 
is a spatial nearest neighbour of a hub and Aw = 1/2 
for a node at the farthest possible distance from the hub. 
For an 'average' node one has Aw « 1/4. Thus, on av- 
erage in d = 1 the cost of hub formation is larger than 
for d — 00. Benefits from hub formation accrue from 
reduced pathlength, and are independent of the dimen- 
sion of the underlying space. It appears a reasonable 
estimate to use the difference in pathlength that results 
from directly linking to a hub node and linking to a node 
which is connected to a hub. Hence reductions in aver- 
age pathlength from linking to another node are at most 
of order 1. Since for large systems Ait; ~ 1/N < const, 
for d = 00 and (Aw) = const, in d = 1 this heuristic 
argument suggests why hub formation is suppressed for 
d=l. 

To gain a better understanding of the network evolu- 
tion process we have analyzed the dependence of typi- 
cal network configurations on the parameter A. Data for 
some averaged network statistics, compared to the global 
optimum, are displayed in the panels of Fig. [3J but see 
also Fig. [5] for a graphical illustration of the optimal net- 
work topologies in d = 1. Two main observations stand 
out. First, panel (a) of Fig. |3J clearly illustrates that 
there is a deviation between the energies E(X) obtained 
by global and local competitive optimization for large A, 
while both procedures reach similar energies for small A. 
Closer inspection shows that the point where differences 
start to appear indeed coincides with A c = 0.21, the tran- 
sition point at which the globally optimal configuration 
is realized by a single central hub. This is explained by 
our above argument that the central hub configuration is 
not stable in the competitive process. Second, all other 
network statistics show the absence of sharp transitions 
in the competitive optimization when varying A. This is 
again not surprising, as the global optima characterized 
by the existence of hub nodes are unstable in the com- 
petitive process. Rather, the competitive optimization 
generates densely connected regular graphs for low cost 
of wire and then gradually increases the variance in de- 
grees as wire becomes more expensive, cf. panel (d). As 
this happens, the amount of wire and number of links 
decrease gradually (panels (b) and (c)), also entailing 
gradual decreases in the average link length (panel (f)), 
clustering coefficient (panel (e)) and average pathlength 
(panel (h)). 

The suppression of hub formation for expensive wire 
has an important consequence: it also impedes the devel- 
opment of uniform length distributions associated with 
the star configurations. As a result, the power laws in 
the link length distributions (observed for very expen- 
sive wire in section [TTJ) persist over a much wider range of 
A parameters. Panels (a) and (b) of Fig. [7] illustrate the 
emergence of the power law behaviour for a large system 
of N = 900 nodes. Over several orders of magnitude the 
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FIG. 7: Distribution of link lengths P(w) for a system of 
N = 900. Distributions are for d - 1 and A = 0.95 (a), 
d = 1 and A = 0.5 (b), d = 2 and A = 0.95 (c), and d = 
2 and A = 0.9 (d). The straight lines indicate power laws 
P(w) oc w~ a with exponents a = 1.18, a — 1.25, a = 0.9 
and a = 1.0, respectively. The data represent averages over 5 
network configurations and are binned logarithmically. Note 
that for N = 900 the maximum distance on the lattice is 
dmax = 450 for d — 1 and d max ~ 21.2 for d — 2. 



which appears the most relevant case for many practical 
applications. 

Let us again follow the heuristics developed in the pre- 
vious subsection and consider the costs and benefits of 
hub formation. To link to a hub node, wiring costs Aw 
in d = 2 range between 1/(N - 1) and d max (2)/(N - 1) 
and are, on average, of order Aw ~ N~ 1 ' 2 . Thus, for 
a large enough system we observe that Aw < const. In 
fact, extending the above argument to arbitrary d, one 
has Aw ~ N 1 / d ~ 1 , suggesting that hub formation be- 
comes easier in higher dimensions. 

As already observed in the introduction, cases of d > 1 
are hard to access in numerical simulations, since the 
maximum distances on higher dimensional lattices de- 
crease strongly with the dimension for constant system 
size. For this reason, it is not possible to obtain reli- 
able numerical results for the full phase diagram for a 
reasonable system size. We have, however, observed hub 
formation in d = 2 for N = 900 in numerical simulations 
for A = 0.99. For smaller A, when hub formation is still 
suppressed, the power law behaviour in the link length 
distributions observed for d = 1 is recovered. Figure [7] 
(c) and (d) display data for two example situations. All 
cases we have investigated suggest a more gradual decay 
of the link length distributions in d — 2 than in d = 1, 
measured exponents being in the range a sw 0.9. ..1.0. 



tails of the link size distributions are very well described 
by power laws P(w) ex w~ a with exponents a = 1.18 
(for very expensive wire A = 0.95) and a = 1.25 (for 
intermediate cost of wire A = 0.5). In contrast to the ob- 
servations of Ref. [4|], which reported exponents a > 2, 
the competitive optimization thus also leads to a much 
less steep decay of link lengths. These results are also 
consistent with 18], who find that a threshold exponent 
a c = d + 1 in the power laws of link length distributions 
separates a small world from a long world regime. Apply- 
ing the results of [l8[ to the case of d — 1 , small worlds 
require a < 2, which is what we have observed in our 
simulations. 

The deviation of the power law exponents from a — 2 is 
also quite interesting when compared to the hierarchical 
partitioning trees introduced in subsection lll Bl Since ex- 
ponents a < 2 naturally occur for the (globally) subopti- 
mal trees with v < 1/k one may argue that the structure 
of the competitively evolved networks comprises multi- 
ple overlapping suboptimal EPT's. Changing the cost- 
pressure of the wire A would smoothly lead to different 
spatial arrangements of the EPT's, which agrees with the 
smooth increase of the power law exponent a observed 
in the numerical simulations when A is increased. 



C. The case of d = 2 

In this section we briefly comment on the link length 
distributions under competitive optimization in d = 2, 



IV. SUMMARY AND DISCUSSION 

In this paper we have considered networks that opti- 
mize a balance between the (infrastructure-) cost of links 
and a measure for communication, i.e. average shortest 
pathlengths. In contrast to previous studies like [J, Q, 
the number of links was not fixed and networks were em- 
bedded in space. Additionally, the analysis included not 
only the globally optimal solutions, but also competitive 
solutions, where individual nodes strive to optimize their 
local cost-communication balances. Our analysis reveals 
marked differences between both situations and we have 
analysed and discussed how these differences depend on 
the dimensionality of the underlying space. 

Depending on the cost of wire, in globally optimal net- 
works three phases can be distinguished: (i) fully con- 
nected graphs, (ii) (almost) regular small worlds, and (iii) 
star-like networks. Even though the location of the tran- 
sitions between these phases depends on the dimension, 
this general structure of the phase diagram holds for all d. 
In a small interval of A-parameters shortly before chains 
become optimal, tree-like networks with power law link 
length distributions were found. The structure of these 
trees is hierarchical and the exponent of the power laws 
in the link length distributions agrees well with results 
derived for hierarchical partitioning trees. 

In the competitive optimization process only the net- 
work structures that occur in phases (i) and (ii) were 
found to be stable and the structure of the phase dia- 
gram was found to change when changing the dimension 
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of the underlying space. 

For d = oo, increasing the cost of wire transitions be- 
tween a complete graph, regular small worlds and differ- 
ent types of periphery-core networks have been found. 
Hence, in the competitive optimization process, the 
regime of star-like networks discussed in Q is replaced by 
regimes of networks composed of a highly cliquish core 
consisting of hub nodes and a periphery composed of low 
degree nodes. When varying the cost of wire, sharp tran- 
sitions occur between different periphery-core arrange- 
ments, characterized by different core sizes. 

For d — 1 hub-formation is suppressed in the competi- 
tive optimization and thus sharp transitions between dif- 
ferent regimes are absent. In spite of this, we find power 
law link length distributions with exponents larger than 
observed for very expensive wire in the globally optimal 
networks. The parameter regime where such power laws 
persist is much larger than for the globally optimal net- 
works. Optimal networks in this regime are also not tree- 
like, but relatively densely connected. Power law link 
length distributions with exponents a < 2 hint at the 
presence of overlapping suboptimal hierarchical trees. 

We have argued that hub-formation again occurs for 
d > 1. The regime of trade-off parameters where star- 
like networks dominate grows with the dimension of the 
underlying space. Nevertheless, power laws in the link 
length distribution could be recovered in the regime (ii) 
of (almost) regular small worlds. The observed exponents 
were found to be in the range of exponents recently ob- 
served for systems embedded in 2-dimensional space, e.g., 
for the internet [lj] and for integrated circuits [la ]. 

As a last point, we observe that the regime of power 
law link length distributions is bounded in two regards. 
The first limitation is when the cost of wire becomes very 
cheap. In this case link length distributions become uni- 
form at longer length scales, while reflecting the details 
of the underlying space at short length scales. The sec- 
ond bound is hub-formation. Even though, in principle, 
hubs are unstable in the competitive optimization pro- 
cess, the formation of hubs can occur when a majority 
of other nodes attempt to link to prospective hubs faster 
than they can drop links when it is their turn to optimize 
their position in the network. This process results in the 
formation of periphery-core networks. Comparisons of 
the costs and benefits of hub formation indicate that it 
becomes more prevalent when increasing the dimension 
of the underlying space. 
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Appendix: Calculation of the average pathlength for 
EPT's 



node O and denote the two branches emerging from the 
root node by A and B. Correspondingly, denote the root 
nodes of the branches by Oa and Ob ■ Since N = 2 fc+2 — 1 
the whole tree consists of k+1 hierarchical levels, whereas 
the subtrees A and B have k levels each and are both 
comprised of Na = Ab = 2 k+1 — 1 nodes. 

We first consider the length of a path from a node at 
level k,A in subtree A to a node at level ks in subtree 
B. From level kA one needs kA steps to reach Oa, 2 
steps to get to branch B via Oa and O and a further ks 
steps to reach level ks in subtree B. Hence, the average 
pathlength from a node at level kA in subtree A to nodes 
in B is 



h A - 



2 fc+i _ 



1 k 

T -j ]T (2 + k A + k B )2 k 



fc B =o 



(2 + k A ) 



(k-l)2 k+1 +2 
2 k + 1 - 1 



(Al) 
(A.2) 



This allows to calculate the average pathlength be- 
tween nodes in A and nodes in B, which is 



; w; = ^J-y E 2kAl 

k A =0 

) 0-l)2 fc+1 + 2 
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(A.3) 
(A.4) 



One notes that Iab ~ 2k when 2 k+1 3> 1. Similarly, 
to reach a node at level kA in subtree A from O fc^ + 1 
steps are required. Thus, the average pathlength from O 
to A is 



l OA = ¥ ^ l Y.{l + k A )2 

k A =0 



k A 



= 1 



AB 



1. 



(A.5) 
(A.6) 



Making use of the symmetrical structure of the tree for 
the average pathlength of the EPT one thus has 



'EPT 



N(N - 1) 
N A + 3 
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41vaioa h ~ mb 



4A A + 2 



Iab — 
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(A.7) 
(A.8) 



For large trees N ^> 1 the above simplifies to 



Iept = 2 lo §2 N ~ !■ 



(A.9) 



To calculate the average pathlength for EPT's consider 
an EPT comprised of TV = 2 k+2 — 1 nodes with root 
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